Search CORE

29 research outputs found

CentralNet: a Multilayer Approach for Multimodal Fusion

Author: A Dhall
D Lahat
M Kang
N Neverova
N Neverova
PK Atrey
S Chandar
S Escalera
Y LeCun
Z Gu
Publication venue
Publication date: 22/08/2018
Field of study

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media. While most of the past multimodal approaches either work by projecting the features of different modalities into the same space, or by coordinating the representations of each modality through the use of constraints, our approach borrows from both visions. More specifically, assuming each modality can be processed by a separated deep convolutional network, allowing to take decisions independently from each modality, we introduce a central network linking the modality specific networks. This central network not only provides a common feature embedding but also regularizes the modality specific networks through the use of multi-task learning. The proposed approach is validated on 4 different computer vision tasks on which it consistently improves the accuracy of existing multimodal fusion approaches

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Accessibility-based reranking in multimedia search engines

Author: Anastasios Drosou
Dimitrios Tzovaras
DS Friedman
EM Fine
F Liu
H Brettel
H Hirvelä
H Kim
I Kalamaras
Ilias Kalamaras
IY Kim
J Liu
J Sang
JR Lavery
KW-T Leung
L Zhang
M Wang
Nikolaos Dimitriou
NJ Belkin
PK Atrey
S Lawrence
S Tajima
S Yang
T-L Ji
Y Nikulin
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/08/2016
Field of study

Traditional multimedia search engines retrieve results based mostly on the query submitted by the user, or using a log of previous searches to provide personalized results, while not considering the accessibility of the results for users with vision or other types of impairments. In this paper, a novel approach is presented which incorporates the accessibility of images for users with various vision impairments, such as color blindness, cataract and glaucoma, in order to rerank the results of an image search engine. The accessibility of individual images is measured through the use of vision simulation filters. Multi-objective optimization techniques utilizing the image accessibility scores are used to handle users with multiple vision impairments, while the impairment profile of a specific user is used to select one from the Pareto-optimal solutions. The proposed approach has been tested with two image datasets, using both simulated and real impaired users, and the results verify its applicability. Although the proposed method has been used for vision accessibility-based reranking, it can also be extended for other types of personalization context

Crossref

Springer - Publisher Connector

Spiral - Imperial College Digital Repository

Investigating non-classical correlations between decision fused multi-modal documents

Author: A Aspect
A Pathak
A Tversky
AM Gleason
BS Cirel’son
CJ Rijsbergen Van
D Aerts
D Aerts
JF Clauser
M Grubinger
Massimo Melucci
N Gisin
PD Bruza
PD Bruza
PK Atrey
T Baltrušaitis
T Veloz
Y Hou
Y Hou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/10/2018
Field of study

Correlation has been widely used to facilitate various information retrieval methods such as query expansion, relevance feedback, document clustering, and multi-modal fusion. Especially, correlation and independence are important issues when fusing different modalities that influence a multi-modal information retrieval process. The basic idea of correlation is that an observable can help predict or enhance another observable. In quantum mechanics, quantum correlation, called entanglement, is a sort of correlation between the observables measured in atomic-size particles when these particles are not necessarily collected in ensembles. In this paper, we examine a multimodal fusion scenario that might be similar to that encountered in physics by firstly measuring two observables (i.e., text-based relevance and image-based relevance) of a multi-modal document without counting on an ensemble of multi-modal documents already labeled in terms of these two variables. Then, we investigate the existence of non-classical correlations between pairs of multi-modal documents. Despite there are some basic differences between entanglement and classical correlation encountered in the macroscopic world, we investigate the existence of this kind of non-classical correlation through the Bell inequality violation. Here, we experimentally test several novel association methods in a small-scale experiment. However, in the current experiment we did not find any violation of the Bell inequality. Finally, we present a series of interesting discussions, which may provide theoretical and empirical insights and inspirations for future development of this direction

arXiv.org e-Print Archive

Crossref

Open Research Online (The Open University)

iCLAP: Shape Recognition by Combining Proprioception and Touch Sensing

Author: A Drimus
A Spiers
AM Okamura
EM Petriu
H Zhang
Hongbin Liu
J Bimbo
JL Bentley
Kaspar Althoefer
LA Torres-Mendez
M Charlebois
M Johnsson
M Meier
N Sommer
PK Atrey
R Ibrayev
RS Dahiya
RS Dahiya
RS Fearing
S Khan
S Luo
S Luo
Shan Luo
SJ Lederman
Wenxuan Mou
YB Jia
Z Kappassov
Z Liu
Z Pezzementi
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/06/2018
Field of study

The work presented in this paper was partially supported by the Engineering and Physical Sciences Council (EPSRC) Grant (Ref: EP/N020421/1) and the King’s-China Scholarship Council Ph.D. scholarship

arXiv.org e-Print Archive

University of Liverpool Repository

Crossref

Queen Mary Research Online

King's Research Portal

Improved Multimodal Emotion Recognition for Better Game-Based Learning

Author: B Reeves
F Anaraki
G Lang
JJG Merrienboer Van
JP Gee
JR Landis
K Bahreini
K Bahreini
N Sebe
P Ekman
P Petta
PJ Hager
PK Atrey
R Pekrun
RJ Nadolski
S Bashyal
S Hrastinski
S Kelle
TM Connolly
Z Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Open University of the Netherlands Research Portal

Crossref

Learning a priori constrained weighted majority votes

Author: Amaury Habrard
Aurélien Bellet
D Haussler
D Kedem
Emilie Morvant
F Laviolette
G Lever
KQ Weinberger
L Breiman
L Breiman
M Marchand
Marc Sebban
PK Atrey
R Nock
R Schapire
S Floyd
S Sun
T Graepel
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

SecureCSearch: Secure Searching in PDF over Untrusted Cloud Servers

Author: Atrey PK
Mohanty M
Shah MD
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/05/2020
Field of study

© 2019 IEEE. The usage of cloud for data storage has become ubiquitous. To prevent data leakage and hacks, it is common to encrypt the data (e.g. PDF files) before sending it to a cloud. However, this limits the search for specific files containing certain keywords over an encrypted cloud data. The traditional method is to take down all files from a cloud, store them locally, decrypt and then search over them, defeating the purpose of using a cloud. In this paper, we propose a method, called SecureCSearch, to perform keyword search operations on the encrypted PDF files over cloud in an efficient manner. The proposed method makes use of Shamir's Secret Sharing scheme in a novel way to create encrypted shares of the PDF file and the keyword to search. We show that the proposed method maintains the security of the data and incurs minimal computation cost

OPUS - University of Technology Sydney

Secret sharing approach for securing cloud-based pre-classification volume ray-casting

Author: Atrey PK
Mohanty M
Ooi WT
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/08/2022
Field of study

With the evolution in cloud computing, cloud-based volume rendering, which outsources data rendering tasks to cloud datacenters, is attracting interest. Although this new rendering technique has many advantages, allowing third-party access to potentially sensitive volume data raises security and privacy concerns. In this paper, we address these concerns for cloud-based pre-classification volume ray-casting by using Shamir’s (k, n) secret sharing and its variant (l, k, n) ramp secret sharing, which are homomorphic to addition and scalar multiplication operations, to hide color information of volume data/images in datacenters. To address the incompatibility issue of the modular prime operation used in secret sharing technique with the floating point operations of ray-casting, we consider excluding modular prime operation from secret sharing or converting the floating number operations of ray-casting to fixed point operations – the earlier technique degrades security and the later degrades image quality. Both these techniques, however, result in significant data overhead. To lessen the overhead at the cost of high security, we propose a modified ramp secret sharing scheme that uses the three color components in one secret sharing polynomial and replaces the shares in floating point with smaller integers

OPUS - University of Technology Sydney

A design methodology for selecting and placement of sensors in multimedia surveillance systems

Author: Atrey PK
Kankanhalli MS
Ramakrishnan KR
Singh VK
Siva Ram GSVS
Publication venue: ACM Press
Publication date
Field of study

This paper addresses the problem of how to select the optimal number of sensors and how to determine their placement in a given monitored area for multimedia surveillance systems. We propose to solve this problem by obtaining a novel performance metric in terms of a probability measure for accomplishing the task as a function of set of sensors and their placement. This measure is then used to find the optimal set. The same measure can be used to analyze the degradation in system 's performance with respect to the failure of various sensors. We also build a surveillance system using the optimal set of sensors obtained based on the proposed design methodology. Experimental results show the effectiveness of the proposed design methodology in selecting the optimal set of sensors and their placement

Open Access Repository of IISc Research Publications

Learning A Priori Constrained Weighted Majority Votes

Author: Amaury Habrard
Aurélien Bellet
D Haussler
D Kedem
Emilie Morvant
F Laviolette
G Lever
KQ Weinberger
L Breiman
L Breiman
M Marchand
Marc Sebban
PK Atrey
R Nock
R Schapire
S Floyd
S Sun
T Graepel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The published version is available here: http://link.springer.com/article/10.1007/s10994-014-5462-zInternational audienceWeighted majority votes allow one to combine the output of several classifiers or voters. MinCq is a recent algorithm for optimizing the weight of each voter based on the minimization of a theoretical bound over the risk of the vote with elegant PAC-Bayesian generalization guarantees. However, while it has demonstrated good performance when combining weak classifiers, MinCq cannot make use of the useful a priori knowledge that one may have when using a mixture of weak and strong voters. In this paper, we propose P-MinCq, an extension of MinCq that can incorporate such knowledge in the form of a constraint over the distribution of the weights, along with general proofs of convergence that stand in the sample compression setting for data-dependent voters. The approach is applied to a vote of k-NN classifiers with a specific modeling of the voters' performance. P-MinCq significantly outperforms the classic k-NN classifier, a symmetric NN and MinCq using the same voters. We show that it is also competitive with LMNN, a popular metric learning algorithm, and that combining both approaches further reduces the error

HAL-UJM

Crossref

IST Austria: PubRep (Institute of Science and Technology)

Hal-Diderot